Modeling Conditional Independence with Nonuniform Predictions
نویسندگان
چکیده
A statistical language model assigns probability to strings of arbitrary length. Unfortunately, it is not possible to gather reliable statistics on strings of arbitrary length from a nite corpus. Therefore, a statistical language model must decide that each symbol in a string depends on at most a small, nite number of other symbols in the string. In this abstract, we propose a new way to model conditional independence in Markov models. The central feature of our nonuniform Markov model is that it makes predictions of varying lengths using contexts of varying lengths. We believe that our work has two contributions to o er to the eld of Markov modeling. The rst contribution is our interpretation of the interpolation parameters of an interpolated Markov model (Jelinek & Mercer 1980) as beliefs about conditional independence. Prior work has interpreted the interpolation parameters as hidden state transitions between models of different orders, or as smoothing \speci c probabilities" with \general probabilities". Our interpretation gives rise to the second contribution of our work, namely, the use of nonuniform predictions. Nonuniform predictions is a principled way to perform alphabet extension, that is, to make a string become a symbol in the alphabet, an ad hoc technique that can improve model performance (Jeanrenaud et al. 1995). This paper consists of three sections. First, we provide three di erent generative intepretations for the interpolation parameters of a Markov model. These interpretations give rise to a state model, a context model, and our nouniform model. Next, we compare the ability of these three interpretations to model local independence and global independence. We argue that the nonuniform model combines the ability of the state model to properly model global independence with the ability of the context model to properly model local independence. Finally, we prove that the nonuniform interpretation is fundamentally di erent from either uniform interpretation because it is not possible to map a nonuniformmodel into an extensionally equivalent uniform model. The full paper (Ristad & Thomas 1996) provides e cient algorithms for evaluating the probability of a string according to a nonuniform model, for nding the most likely nonuniform generation path for a given string, and for optimizing the parameters of a nonuniform model on a training corpus. Our notation is as follows. Let A be a nite alphabet of distinct symbols, jAj = k, and let x 2 A denote an arbitrary string of length T over the alphabet A. Then xji denotes the substring of x T that begins at position i and ends at position j. For convenience, we abbreviate the unit length substring xi as xi and the length t pre x of x as x.
منابع مشابه
Nonuniform Markov models
We propose a new way to model conditional independence in Markov models. The central feature of our nonuniform Markov model is that it makes predictions of varying lengths using contexts of varying lengths. Experiments on the Wall Street Journal reveal that the nonuniform model performs slightly better than the classic interpolated Markov model of Jelinek and Mercer (1980). This result is somew...
متن کاملFinancial Risk Modeling with Markova Chain
Investors use different approaches to select optimal portfolio. so, Optimal investment choices according to return can be interpreted in different models. The traditional approach to allocate portfolio selection called a mean - variance explains. Another approach is Markov chain. Markov chain is a random process without memory. This means that the conditional probability distribution of the nex...
متن کاملMarginal conditional independence models with application to graphical modeling
Conditional independence models are defined by a set of conditional independence restrictions and play an important role in many statistical applications, especially, but not only, graphical modeling. In this paper we identify a subclass of these models which are hierarchical marginal log-linear, as defined by Bergsma and Rudas (2002a). Such models are smooth, which implies the applicability of...
متن کاملSpice Compatible Model for Multiple Coupled Nonuniform Transmission Lines Application in Transient Analysis of VLSI Circuits
An SPICE compatible model for multiple coupled nonuniform lossless transmission lines (TL's) is presented. The method of the modeling is based on the steplines approximation of the nonuniform TLs and quasi-TEM assumptions. Using steplines approximation the system of coupled nonuniform TLs is subdivided into arbitrary large number of coupled uniform lines (steplines) with different characteristi...
متن کاملMarkov Properties for Linear Causal Models with Correlated Errors Markov Properties for Linear Causal Models with Correlated Errors
A linear causal model with correlated errors, represented by a DAG with bi-directed edges, can be tested by the set of conditional independence relations implied by the model. A global Markov property specifies, by the d-separation criterion, the set of all conditional independence relations holding in any model associated with a graph. A local Markov property specifies a much smaller set of co...
متن کامل